Optimism-Based Adaptive Regulation of Linear-Quadratic Systems

نویسندگان

چکیده

The main challenge for adaptive regulation of linear-quadratic systems is the tradeoff between identification and control. An policy needs to address both estimation unknown dynamics parameters (exploration), as well underlying system (exploitation). To this end, optimism-based methods that bias in favor optimistic approximations true parameter are employed literature. A number asymptotic results have been established, but their finite-time counterparts few, with important restrictions. This article establishes worst-case regret policies. presented high probability upper bounds optimal up logarithmic factors. nonasymptotic analysis requires following very mild assumptions: stabilizability system's dynamics, limiting degree heaviness noise distribution. establish such bounds, certain novel techniques developed comprehensively probabilistic behavior dependent random matrices heavy-tailed distributions.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Regret Bounds for the Adaptive Control of Linear Quadratic Systems

We study the average cost Linear Quadratic (LQ) control problem with unknown model parameters, also known as the adaptive control problem in the control community. We design an algorithm and prove that apart from logarithmic factors its regret up to time T is O( √ T ). Unlike previous approaches that use a forced-exploration scheme, we construct a high-probability confidence set around the mode...

متن کامل

Adaptive fuzzy pole placement for stabilization of non-linear systems

A new approach for pole placement of nonlinear systems using state feedback and fuzzy system is proposed. We use a new online fuzzy training method to identify and to obtain a fuzzy model for the unknown nonlinear system using only the system input and output. Then, we linearized this identified model at each sampling time to have an approximate linear time varying system. In order to stabilize...

متن کامل

On Optimal Quadratic Regulation for Discrete-Time Switched Linear Systems

This paper studies the discrete-time linear quadratic regulation problem for switched linear systems (DLQRS) based on dynamic programming approach. The unique contribution of this paper is the analytical characterizations of both the value function and the optimal control strategies for the DLQRS problem. Based on the particular structures of these analytical expressions, an efficient algorithm...

متن کامل

Linear Quadratic Regulation using Reinforcement

In this paper we describe a possible way to make reinforcement learning more applicable in the context of industrial manufacturing processes. We achieve this by formulating the optimization task in the linear quadratic regulation framework, for which a conventional control theoretic solution exist. By rewriting the Q-learning approach into a linear least squares approximation problem, we can ma...

متن کامل

Adaptive sampled-data based linear quadratic optimal control of stochastic systems

The problem of sampled-data (SD) based adaptive linear quadratic (LQ) optimal control is considered for linear stochastic continuous-time systems with unknown parameters and disturbances. To overcome the difficulties caused by the unknown parameters and incompleteness of the state information, and to probe into the influence of sample size on system performance, a cost-biased parameter estimato...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: IEEE Transactions on Automatic Control

سال: 2021

ISSN: ['0018-9286', '1558-2523', '2334-3303']

DOI: https://doi.org/10.1109/tac.2020.2998952